Visualization of protein structure relationships us- ing constrained twin kernel embedding

نویسندگان

  • Yi Guo
  • Jun-Bin Gao
  • Paul Wing Hing Kwan
  • Kevin Xinsheng Hou
چکیده

In this paper, a recently proposed dimensionality reduction method called Twin Kernel Embedding (TKE) [10] is applied in 2-dimensional visualization of protein structure relationships. By matching the similarity measures of the input and the embedding spaces expressed by their respective kernels, TKE ensures that both local and global proximity information are preserved simultaneously. Experiments conducted on a subset of the Structural Classification Of Protein (SCOP) database confirmed the effectiveness of TKE in preserving the original relationships among protein structures in the lower dimensional embedding according to their similarities. This result is expected to benefit subsequent analyses of protein structures and their functions.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A THEORETICALLY CORRECT RESOURCE USAGE VISUALIZATION FOR THE RESOURCE-CONSTRAINED PROJECT SCHEDULING PROBLEM

The cumulative resource constraints of the resource-constrained project scheduling problem (RCPSP) do not treat the resource demands as geometric rectangles, that is, activities are not necessarily assigned to the same resource units over their processing times. In spite of this fact, most papers on resource-constrained project scheduling mainly in the motivation phase use a strip packing of re...

متن کامل

Incorporating homologues into Sequence Embeddings for protein Analysis

Statistical and learning techniques are becoming increasingly popular for different tasks in bioinformatics. Many of the most powerful statistical and learning techniques are applicable to points in a Euclidean space but not directly applicable to discrete sequences such as protein sequences. One way to apply these techniques to protein sequences is to embed the sequences into a Euclidean space...

متن کامل

Learning Random Fourier Features by Hybrid Constrained Optimization

The kernel embedding algorithm is an important component for adapting kernel methods to large datasets. Since the algorithm consumes a major computation cost in the testing phase, we propose a novel teacher-learner framework of learning computation-efficient kernel embeddings from specific data. In the framework, the highprecision embeddings (teacher) transfer the data information to the comput...

متن کامل

Embedding of a 2D Graphene System in Non-Commutative Space

The BFT approach is used to formulate the electronic states in graphene through a non-commutative space in the presence of a constant magnetic field B for the first time. In this regard, we introduce a second class of constrained system, which is not gauge symmetric but by applying BFT method and extending phase space, the second class constraints  converts  to the first class constraints so th...

متن کامل

Gaussian Mixture Models with Component Means Constrained in Pre-selected Subspaces

We investigate a Gaussian mixture model (GMM) with component means constrained in a pre-selected subspace. Applications to classification and clustering are explored. An EM-type estimation algorithm is derived. We prove that the subspace containing the component means of a GMM with a common covariance matrix also contains the modes of the density and the class means. This motivates us to find a...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2008